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(57) Abstract 

Hie process according to the present invention allows expression and isolation of polypeptides with the proteolytic activity of HCV 
NS3 protease m a pure, catalyticaHy active form, and in amounts that are sufficient for discovery of NS3 protease inhibitors and for 
aetermmtion of the uiree^mensional structure of the NS3 protease. A further subject of the present invention is a procedure that defines 
foe chemical and physical conditions necessary for completion of the proteolytic activity of the above polypeptides. Tht invention further 
compmes new composmons of matter (expression vectors) containing nucleotide sequences capable of expressing the LTm JZnri 
r^lypeptdes ,n culture cells. Finally, new compounds of matter are defined, suitable to measure the above proteolytic activity, and useful 
to develop NS3 protege inhibitors and therefore therapeutic agents for use against HCV. The figure shows the kinetic param^of HCV 
NS3 protease using the S3 depsipeptide substrate (SEQ ID NO: 45). 
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METHODOLOGY TO PRODUCE, PURIFY AND ASSAY POLYPEPTIDES 

WITH THE PROTEOLYTIC ACTIVITY OF THE HCV NS3 PROTEASE 

DRSrRTPTTDM 

The present invention relates to molecular biology 
and to hepatitis C virus (HCV) virology. More 
specifically, the invention has as its subject a process 
for producing, in a pure form and in high quantities, 
polypeptides having the proteolytic activity of HCV NS3 
protease, and a method for the effective reproduction in 
vitro of the proteolytic activity of these polypeptides 
in order to define an enzymatic assay capable of 
selecting, for therapeutic purposes, compounds inhibiting 
the enzyme activity associated with NS3 . 

As is known, the hepatitis C virus (HCV) is the main 
etiological agent of non-A, hon-B hepatitis (NANB) . It is 
estimated that HCV causes at least 90% of post- 
transfusional NANB viral hepatitis and 50% of sporadic 
NANB hepatitis. Although great progress has been made in 
the selection of blood donors and in the immunological 
characterisation of blood used for transfusions, there is 
still a high number of HCV infections among recipients of 
blood transfusions (one million or more infections every 
year throughout the world) . Approximately 50% of HCV- 
infected individuals develop liver cirrhosis within a 
period that can range from 5 to 40 years. Furthermore, 
recent clinical studies suggest that there is a 
correlation between chronic HCV infection and the 
development of hepatocellular carcinoma. 

HCV is an enveloped virus containing an RNA positive 
genome of approximately 9.4 kb. This virus is a member of 
the Flaviviridae family, the other members of which are 
the flavi viruses and the pestiviruses . 

The RNA genome of HCV has recently been mapped. 
Comparison of sequences from the HCV genomes isolated in 
various parts of the-world has shown that these sequences 
can be extremely heterogeneous. The majority of the HCV 
genome is occupied by an open reading frame (ORF) that 
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can vary between 9030 and 9099 nucleotides. This ORF 
codes for a single viral polyprotein, the length of which 
can vary from 3010 to 3033 amino acids. During the viral 
infection cycle, the polyprotein is proteolytically 
processed into the individual gene products necessary for 
replication of the virus. 

The genes coding for HCV structural proteins are 
located at the 5' -end of the ORF, whereas the region 
coding for the non- structural proteins occupies the rest 
of the ORF. 

The structural proteins consist of C (core, 21 kDa) 
El (envelope, gp37) and E2 (NS1, gp61) . C is a non- 
glycosylated protein of 21 kDa which probably forms the 
viral nucleocapsid. The protein El is a glycoprotein of 
approximately 37 kDa, which is believed to be a 
structural protein for the outer viral envelope. E2, 
another membrane glycoprotein of 61 kDa, is probably a 
second structural protein in the outer envelope of the 
virus. 

The non- structural region starts with N§2 (p24) , a 
hydrophobic protein of 24 kDa whose function is unknown. 

NS3, a protein of 68 kDa which follows NS2 in the 
polyprotein, is predicted to have two functional domains: 
a serine protease domain within the first 200 araino- 
terminal amino acids, and an RNA- dependent ATPase domain 
at the carboxy terminus. 

The NS4 gene region codes for NS4A (p6) and NS4B 
(p26) , two hydrophobic proteins of 6 and 26 kDa, 
respectively, whose functions have not yet been fully 
clarified. 

The NS5 gene region also codes for two proteins, 
NS5A (p56) and NS5B (p65) , of 56 and 65 kDa, 
respectively. Amino acid sequences present in all the 
RNA-dependent RNA polymerases can be recognised within 
the NS5 region. This suggests that the NS5 region 
contains components of the viral replication machinery. 
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Various molecular biological studies indicate that 
the signal peptidase, a protease associated with the 
endoplasmic reticulum of the host cell, is responsible 
for proteolytic processing in the non- structural region, 
that is to say at sites C/El, E1/E2 and E2/NS2. 

The serine protease in NS3 is responsible for 
cleavage at the junctions between NS3 and NS4A, between 
NS4A and NS4B, between NS4B and NS5A and between NS5A and 
NS5B. In particular it has been found that the cleavage 
made by this serine protease leaves a cysteine or a 
treonine residue on the amino-terminal side (position Pi) 
and an alanine or serine residue on the carboxy- terminal 
side (position PI') of the cleavage site. It has been 
shown that the protease contained in NS3 is a 
heterodimeric protein in vivo, forming a complex with the 
protein NS4A. Formation of this complex increases 
proteolytic activity on sites NS4A/NS4B and NS5A/NS5B, 
and is a necessary requisite for proteolytic processing 
of site NS4B/NS5A. 

A second protease activity of HCV appears to be 
responsible for the cleavage between NS2 and NS3. This 
protease activity is contained in a region comprising 
both part of NS2 and the portion of NS3 containing the 
serine protease domain, but does not use the same 
catalytic mechanism as the latter. 

A substance capable of interfering with the 
proteolytic activity associated with the protein NS3 
might constitute a new therapeutic agent. In effect, 
inhibition of this protease activity would involve 
stopping the proteolytic processing of the non- structural 
region of the HCV polyprotein and, consequently, would 
prevent viral replication of the infected cells. 

This sequence of events has been verified for the 
homologous flavivirus, which, unlike HCV, infects cell 
line cultures. In this case, it has been shown that 
genetic manipulations involving generation of a protease 



WO 97/08304 



- 4 - 



PCT/IT96/00163 



no longer capable of carrying out its catalytic activity, 
abolishes the ability of the virus to replicate (1) . 

Furthermore, it has been widely shown, both in vitro 
and in clinical studies, that compounds capable of 
interfering with the HIV protease activity are capable of 
inhibiting replication of this virus (2) . 

The methods used to generate molecules with 
therapeutic potential are known to . those operating in 
this field. Generally speaking, collections of compounds 
containing a large number of single chemical entities 
with a high molecular diversity are made to undergo an 
automatised assay in order to identify single active 
agents, which then undergo further chemical modifications 
in order to improve their therapeutic potential. Other 
approaches may include rational modification of 
substrates or ligands of specific target protein, with 
the aim of developing high binding affinity compounds 
capable of altering or abolishing the biological activity 
of the protein under examination. Determination of the 
three-dimensional. structure of a target protein, by means 
of methods known in the sector as X-ray crystallography 
or nuclear magnetic resonance (NMR) allows rational 
design of molecules capable of binding specifically to 
the protein and which, as a result of this, have the 
ability to interfere with the biological properties of 
that protein. 

Research on compounds capable of interfering with 
the biological activity of the protease contained in the 
hepatitis C virus NS3 protein is hampered by the 
difficulty in producing sufficient amounts of purified 
protein with unaltered catalytic properties, and by the 
need to use co-factors to enhance the activity of the 
enzyme in vitro. 

There is therefore a need in the specific field for 
a process to produce NS3 , or similar products , in larger 
amounts that has been possible in the past, and with an 
in vitro activity sufficient to select inhibitors. 
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The present invention consists of isolated and 
purified polypeptides, with the proteolytic activity of 
the HCV protein NS3, characterised by the fact that they 
have an amino acid sequence chosen from among the 
sequences SEQ ID NO:l, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID 
N0:4 and SEQ ID NO:5. 

The invention also comprises expression vectors - to 
produce the polypeptides represented by sequences SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID N0:4 and SEQ ID 
NO: 5 which have the proteolytic activity of HCV NS3 - 
comprising: 

- a polynucleotide coding for one of said 
polypeptides; 

functional regulation, transcription and 
translation sequences in said host cell, operatively 
bonded to said polynucleotide coding far one of said 
polypeptides ; and 

optionally, a selectable marker. 
The invention also extends to a host cell, either 
eukaryotic or prokaryotic, transformed using an 
expression vector containing a DNA sequence coding for 
SEQ ID NO:l, SEQ ID N0:2, SEQ ID NO:3, SEQ ID NO:4 and 
SEQ ID NO: 5 in such a way as to allow said host cell to 
express the specific coded polypeptide in the chosen 
sequence. The invention further comprises a process for 
preparation of polypeptides with sequence selected from 
the group comprising SEQ ID NO:l, SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 4 and SEQ id NO: 5, characterised by the 
fact that it comprises, in combination, the following 
operations : 

- transformation of a host cell, either eukaryotic 
or prokaryotic, using one of the expression vectors 
mentioned above; and 

- expression of the desired nucleotide sequence to 
produce the chosen polypeptide; and 

- purification of the polypeptide thus obtained, 
avoiding resolubilisation protocols. 
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The present invention also has as its object a 
method for reproducing in vitro the proteolytic activity 
of the HCV NS3 protease, characterised by the fact that 
the activity of purified polypeptides, with sequences 
chosen from the group comprising SEQ ID NO:l, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4 and SEQ ID NO:S, similar 
to NS3, is reproduced in a solution containing 30-70 nM 
Tris pH 6.5-8.5, 3-30 mM dithiotreitol (DTT) , 0.5-3% 3- 
[ (3-colammide-propyl) -dimethyl -ammonium] -l- 
propansulphonate (CHAPS) and 30-70% glycerol at 
temperatures of between 20 and 25°C and by the fact that 
in these conditions the activity of the above mentioned 
polypeptides can be kinetically determined and quantified 
on peptide substrates even in the absence of co-factors. 

An assay of the protease activity of the 
polypeptides SEQ ID NO:l, SEQ ID NO:2, SEQ ID N0:3, SEQ 

ID NO: 4 and SEQ ID NO: 5 can be performed by cleaving a 
substrate providing detectable products. The cleavage is 
preferably detected using methods based on radioactive, 
colorimetric or fluorimetric signals. Methods such as 
HPLC and the like are also suitable . According to the 
present invention, the substrates used are synthetic 
peptides corresponding to the HCV polyprotein NS4A/4B 
junction. If necessary, peptides containing the amino 
acid sequence SEQ ID NO : S , or parts thereof, can be used 
as co-factor of the NS3 protease. 

Peptides suitable for use as substrates are the 
peptide represented by the sequence SEQ ID NO: 7 and 
derivatives thereof with N and/or C- terminal deletions 
(SEQ ID NOS:8-l2, 14, 18-20) and the peptide represented 
by the sequence SEQ ID NO: 47. Particularly suitable are 
the decapeptides represented by the sequences SEQ ID 
NOS: 18 -20, especially SEQ IS NO: 18 and the sequences 
derived therefrom SEQ ID NOS: 29-32, 35. 

These peptides can be used for a high -throughput 
assay of NS3 protease activity at a concentration of the 
latter of between 100-200 nM. 
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According to the invention depsipeptide substrates 
(peptides with at least one ester bond in the sequences) 
can also be. used advantageously for a high- throughput 
assay of the activity of the NS3 protease. It is, in 
fact, known that it is desirable to run the assay at the 
lowest possible enzyme concentration compatible with 
sufficient substrate conversion. This maximises 
sensitivity to inhibition and allows to screen for 
inhibitors which are present at very low concentrations 
in compound mixtures or combinatorial libraries. 
Substrates for NS3 protease with a standard amide at the 
scissile bond between residues PI and Pi" have K^^/YL 
values between 30-100 M" 1 s' 1 . This sets a practical rwge 
of enzyme concentration for a high- throughput assay of 
100-200 nM. To lower this concentration it is necessary 
to use substrates with higher K^/K,, values. Substrates 
containing an ester bond between PI and Pi" are ideally 
suited for this, since formation of the acyl-enzyme 
intermediate is accomplished much more readily due to the 
more thermodynamically favourable transesterification 
reaction (8) . The depsipeptide substrates according to 
the invention have very high ^/K^ values, and this 
brings the useful range of NS3 concentration in the high- 
throughput assay to 0.5-2 nM. These substrates may be 
synthesised in high yield on solid-phase by standard 
chemical methodology. 

Conventional assays are suitable for high throughput 
screening, but they require hydrolysis of at least 10% of 
the substrate before the product can be detected 
conveniently. This precludes determination of true 
initial rates, which are important for accurate kinetic 
studies. To overcome these difficulties, an assay has 
been developed that allows continuous monitoring of 
protease activity. The assay relies on specially tailored 
synthetic substrates, which are capable of direct-, 
continuous signal generation that is directly 
proportional to the extent of substrate hydrolysis, thus 
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avoiding the need for separation of the substrate from 
the reaction product. The depsipeptides used (SEQ ID 
NOS: 45 and 46), the chemical formulas of which are given 
in figure 12, are internally quenched fluorogenic 
substrates based on resonance energy transfer (RET) . They 
contain a fluorescent donor, 5- [(2'- 

aminoethyl) amino] naphthalenesulfonic acid (EDANS) , near 
one end of the peptide, and an acceptor group, 4-[[4>- 
(dimethylamino) phenyl] azo] benzoic acid (DABCYL) near the 
other end. The fluorescence of this type of substrate is 
initially quenched by intramolecular RET between the 
donor and the acceptor, but as the enzyme cleaves the 
substrate the fluorescence increases. EDANS and DABCYL 
were selected as donor/acceptor pair because of the 
excellent spectral overlap between the fluorescent 
emission of the former and the absorption of the latter 
(13-17) . ret efficiency depends on the distance between 
the donor and the acceptor, i.e. the closer the two, the 
higher the quenching. For the EDANS /DABCYL couple, the 
Forster distance for 50% energy-transfer (R„) is 33 A. 
The maximum distance between EDANS/DABCYL reported in a 
substrate is 11 amino acids (19) which, assuming an 
extended conformation for the peptide, corresponds to 
R-39.8 A, with a calculated RET efficiency of 24.5%. This 
corresponds to a 10 -fold increase in fluorescence upon 
substrate cleavage. 

Up to this point a general description has been 
given of the present invention. With the aid of the 
following examples , a more detailed description of 
specific embodiments thereof will now be given, in order 
to give a better understanding of the aims, 
characteristics, advantages and operation methods of the 
invention. 

Figure 1 shows the plasmid vector used for transfer 
and expression of the polypeptide represented by SEQ ID 
NO:l in Spodoptera frugiperda clone 9 cells. 
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Figures 2A and 2B show the plasmid vectors for 
transfer and expression in E. coli of the polypeptides 
represented by sequences SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID 
NO: 4 and SEQ ID NO: 5, respectively. 

Figure 3 shows NS3 activity as a function of the 
concentration of glycerol. 

Figure 4 shows NS3 activity as a function of the 
concentration of CHAPS, 3- [ (3 -colaimnide -propyl) -dimethyl - 
ammonium] -1-propansulphonate. 

Figure 5 shows NS3 activity as a function of pH. 

Figure 6 shows NS3 activity as a function of ionic 
strength. 

Figure 7 shows a diagram of the enzymatic assay to 
measure NS3 activity using as a substrate a peptide Ac- 

Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Lys-e- 
(3 H )-AC (SEQ ID N0:47) . 

Figure- 8 shows the reaction diagram for synthesis of 
the depsipeptide substrate SI represented by. the sequence 
SEQ ID NO: 4.2. 

Figure 9 shows the reaction diagram for synthesis of 
the depsipeptide substrate S2 represented by the sequence 
SEQ ID NO: 43. 

Figure 10 shows the reaction diagram for synthesis 
of the radioactive depsipeptide substrate SI represented 
by the sequence SEQ ID NO: 44. 

Figure 11 shows a high- throughput assay, based on 
radioactive signals, to determine NS3 protease activity. 

Figure 12 shows the chemical formula of the 
depsipeptide substrates (SEQ id NO: 45 and SEQ ID NO: 46) 
for a continuous assay of NS3 activity based on RET 
intramolecular fluorescence quenching. 

Figure 13 shows the reaction diagram for synthesis 
of the depsipeptide substrate S3 (SEQ ID NO:45) . 

Figures 14A and 14B show, respectively, the kinetic 
parameters for the NS3 protease with the substrate S3 
(SEQ ID NO: 45) and fluorescence as a function of time in 
the relevant assay. 
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Systems for expression of foreign genes in insect 
cultured cells, such as Spodoptera frugiperda clone 9 
(Sf9) cells infected with baculovirus vectors are known 
in the art (3) . Heterologous genes are usually placed 
under the control of the strong polyhedrin promoter of 
the Autographa californica nuclear polyhedrosis virus or 
the Bombix jnori nuclear polyhedrosis virus. Methods for 
the introduction of heterologous DNA in the desired site 
in the baculoviral vectors by homologous recombination 
are also known in the art (4) . 

The plasmid vector pBacNS3 (1039-1226) is a 
derivative of pBlueBacIII (Invitrogen) and was 
constructed for transfer of a gene coding for a 
polypeptide with the activity of NS3 (1039-1226). For 
this purpose, the nucleotide sequence coding for this 
polypeptide described in SEQ ID NO:l was obtained by PCR 
• using oligonucleotides that insert an ATG condon at 5 • 
and a TAG stop codon at 3 • in the sequence. The fragment 
obtained in this way was inserted at the BaraHl site of 
the vector pBlueBacIII, following treatment with the 
Klenow DNA polymerase fragment. The plasmid is 
illustrated in figure 1. 

Spodoptera frugiperda clone 9 (sf9) cells and 
baculovirus recombination kits were purchased from 
Invitrogen. Cells were grown on dishes or in suspension 
at 27«C in complete Grace's insect medium (Gibco) 
containing 10% foetal bovine serum (Gibco) . Transf ection, 
recombination, and selection of baculovirus constructs 
were performed as recommended by the manufacturer. 

For protein expression, Sf 9 cells were infected with 
the recombinant baculovirus at a density of 2 x 10* cells 
per ml in a ratio of about 5 virus particles per cell. 
The cells were cultivated in suspension for 72 hours at 
23 »c. Lowering the temperature from 27°C, which 
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corresponds normally to the optimal growth temperature, 
to 23 oc is crucial in order to obtain a soluble and 
active protein. 

After harvesting the cells by centrifugation and 
washing them with PBS (20 mM sodium phosphate pH 7.4, 140 
mM NaCl) the pellet was re -suspended in 25 mM sodium 
phosphate pH 6.5, 20% glycerol, 0.5% 3- [ (3-colammide- 
propyl) -dimethyl-ammonium] -1-propansulphonate (CHAPS) , 10 
mM dithiothreitol (DTT) , l mM ethylendiammino-tetracetic 
acid (EDTA) . The cells were destroyed at 4«C by means of 
four cycles of sonication at 10 W with a duration of 30 
seconds each, using a Branson 250 instrument. The 
homogenate obtained in this way was pelleted by 
centrifugation at 120,000 x g for one hour and the 
supernatant was loaded onto an HR26/10 S-Sepharose column 
(Pharmacia) balanced with 25 mM sodium phosphate pH 6.5, 
10% glycerol, 2 mN DTT, 1 mM EDTA, 0.1% CHAPS at a flow 
rate of 2 ml/min. After washing with two volumes of 
column the protease was eluted with an NaCl gradient 
between 0 and 1 M. The fractions_containing the protease 
were identified using Western blotting methodology with 
NS3- specific polyclonal antibodies, concentrated to 3 ml 
using an Amicon ultrafiltration cell equipped with a YmlO 
membrane and chromatographed onto a Superdex 75 HR26/60 
column (Pharmacia) equilibrated with 50 mM sodium 
phosphate pH 7.5, 10% glycerol, 2 mM DTT, 0.1% CHAPS, 1 
mM ESTA and a flow rate of l ml/min. The fractions 
containing the protease were pooled and underwent further 
chromatography on a Mono-S HR5/5 column (Pharmacia) 
equilibrated with the same buffer used in the previous 
column. The protease was eluted in a pure form from this 
column, applying a linear NaCl gradient between o and 0.5 
M. The protease was stored at -80°C in 50% glycerol, 0.5% 
CHAPS, 10 mM DTT and 50 mM sodium phosphate pH 7.5. The 
yield of the process is 0.5 mg/1 of cells. The purified 
protein has a catalytic activity K^/K^^O^OO M-l s-l 
measured in 50 mM Tris pH 7.5, 50% glycerol, 2% CHAPS, 30 
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mM DTT at 23 °C using the peptide substrate Fmoc-Tyr-Gln- 

Glu-Phe-Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr- 
Ile-Glu-Gln-Gly (SEQ ID NO: 7), derived from the 
polyprotein cleavage site between NS4A and NS4B. The 
cleavage products deriving from this reaction were 
separated using HPLC, isolated and identified by mass 
spectrometry, confirming that proteolytic cleavage took 
place between cysteine and alanine. The concentration of 
protease necessary to determine activity was between 100 
nM and 1.6 uM. 




The plasmids pT7-7(NS3 1039-1226), pT7-7 (NS3 1039- 
1206), pT7-7 (NS3 1027-1206) andpT7-7 (NS3 1033-1206), 
described in figures 2A and 2B , were constructed in order 
to allow expression- in E. coli of the polypeptides 
indicated in SEQ ID NO: 2 and SEQ ID NO: 3, and SEQ ID NO: 4 
and SEQ ID NO: 5, respectively. The protein fragments 
contain variants of the protease domain of the HCV NS3 
protein. The respective fragments of HCV cDNA were cloned 
downstream of the bacteriophage T7 0io promoter and in 
frame with the first ATG codon of the phage T7 gene 10 
protein, using methods that are known to the practice. 
The pT7-7 plasmids containing NS3 sequences also contains 
the gene for the P-lactatnase enzyme that can be used as a 
marker of a selection of E. coli cells transformed with 
these plasmids. 

The plasmids were then transformed in the E. coli 
strain BL21 (DE53) , which is normally employed for high- 
level expression of genes cloned into expression vectors 
containing the T7 promoter. In this strain of E. coli, 
the T7 polymerase gene is carried on the bacteriophage \ 
DE53, which is integrated into the chromosome of BL21 
cells (5) . Expression from the gene of interest is 
induced by addition of isopropylthiogalactoside (IPTG) to 
the growth medium according to a procedure that has been 
previously described (5). Over 90% of the proteins 



WO 97/08304 



- 13 - 



PCT/IT96/00163 



expressed using one of the plasmids mentioned above is 
found in an insoluble form in inclusion bodies, from 
which it is possible to obtain a soluble and active 
protein following refolding methods known to the field 
(see for example (6)). Refolding protocols have often 
variable yields of catalytically active protein, and they 
require extremely controlled conditions, or cause 
irreversible modifications of the protein (such as 
carbamylation in the presence of urea), or require 
impractical procedures, such as the use of extremely 
diluted protein solutions, or dialysis of exceedingly 
large volumes of samples. 

To avoid these problems, a method has been 
developed, which is described below, for the production 
of the HCV protease in a soluble and active form, 
avoiding thus resolubilisation protocols: E . coli BL21 
(DE53) transformed using one of the plasmids mentioned 
above were grown at 37°c until reaching a cell density 
that causes absorption of 0.8 OD (OD stands for optical 
density) at 600 nm. At this point the temperature was 
lowered to 30°C in 15-20 minutes and 400 uM IPTG was 
added to induce expression of the protein. The 
temperature was then lowered further to 22-24°C within a 
period of 20-30 minutes. The cultures were stirred for a 
further 4 hours at this temperature . . At this point the 
cells were harvested by centrifugation and washed using 



PBS. 



The pellets resulting from the operations described 
above were incubated on ice for 5 minutes and re- 
suspended in 25 mM sodium phosphate pH 6.5, 50% glycerol, 
0.5% CHAPS, 10 mM DTT, 1 mM EDTA (buffer A) pre-cooled to 
4°C. 10 ml of this buffer was used for each litre of 
bacterial culture. After a further 5-10 minutes of 
incubation on ice the cell suspension was homogenised 
using a French press. The resulting homogenate was 
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centrifuged at 120,000 x g. The aupernatants from this 
cent rifugat ion were preserved on ice, whereas the pellets 
were re-suspended in buffer A (1 ml to each litre of 
bacteria culture) . After the addition of 1 mM MgCl2 and 
DNasel, the suspension was incubated for 10 minutes at 
20°C and re-centrifuged for l hour at 120,000 x g. The 
supernatant from this second centrifugation was pooled 
with the first supernatant and. the resulting protein 
solution was adsorbed on S-Sepharose (or SP-Sepharose) 
resin (Pharmacia) equilibrated with 25 mM sodium 
phosphate pH 6.5, 10% glycerol, 0.5% CHAPS, 3 mM DTT, 1 
mM EDTA (buffer B) . io ml of resin suspended in S ml of 
buffer B was used for each litre of bacterial culture. 
The resin was stirred for 1 hour at 4°C, collected by 
filtration, washed with buffer B and poured into an 
appropriate chromatography column. The protease was 
eluted with an NaCl gradient between 0 and 1 M. Fractions 
containing the protease were identified using Western 
blotting, pooled and concentrated using Centriprep 10 
concentrators (Amicon) until reaching a concentration of 
6-10 mg/ml in protein, determined using the BIORAD 
method. Up to 3 ml of this solution was loaded onto a HR 
26/60 Superdex 75 or up to 20 ml was loaded onto an HR 
60/600 Superdex 75 (both Pharmacia) equilibrated with 50 
mM sodium phosphate pH 7.5, 10% glycerol, 3 mM DTT, 0.5% 
CHAPS (buffer C) and chromatography was carried out at 1 
ml/min (HR26/60) or 5 ml/min (HR60/600) . The fractions 
containing the protease were pooled and further purified 
by chromatography on HR 5/5 Mono S (Pharmacia) 
equilibrated with buffer C. The protease was eluted from 
this column with an NaCl gradient between 0 and 0.5 M. 
Purification to homogeneity was also possible with the 
following modification: after elution from S-Sepharose 
the fractions containing the protease were diluted 1:4 in 
buffer C and loaded onto Heparin-Sepharese . Elution from 
this resin was obtained with an NaCl gradient between 0 
and 0.5 M. The protein was then chromatographed on 
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hydroxiapatite or Superdex 75 as described above. The 
yield is 1-2 mg of purified protein per litre of 
bacterial culture. 



The purified protein was characterised by means of 
gel filtration, reverse-phase HPLC, mass spectrometry and 
N- terminal sequence analysis. 

Analytical gel filtration experiments showed that 
the protein is monomeric. The protein expressed using 
pT7-7 (NS3 1027-1206) shows three peaks following 
reverse-phase HPLC chromatography. Mass spectrometry 
analysis and determination of the N- terminal sequence 
showed heterogeneity of the N- terminal portion of the 
molecule. Three forms were found, having the following N- 
terminal sequences : 

Met-Ala-Pro-Ile-Thr-Ala-Tyr-Ser-Gln-Gln-Thr (form 1) 
Pro-Ile-Thr-Ala-Tyr-Ser-Gln-Gln-Thr (form 2) 

Ser-Gln-Gln-Thr (form 3) 
To avoid this problem, two experimental strategies 
were adopted: 

1. Homogenisation in the presence of 100 fig/ml of the 
chymostatin protease inhibitor. This inhibitor does not 
inhibit HCV protease activity, but it does inhibit the 
chymotrypsin type proteases, specific for aromatic 
residues like phenylalanine and tyrosine. In this way it 
was possible to purify a single molecular species with 
more than 95% of form 2. 

2. Production of a protease corresponding to form 3 by 
means of the plasmid pT7-7 (NS3 1033-1206) . in this way a 
protein with more than 95% of form 3 was purified. 

example; ? 

Methfld for rem-oriuHna in iHt-m t-ho ar H vihy n f th* wrv 

35 Dfifinifion of rtif rheminal and nhvsi^i ^rfirin.= 

for rpnrodncr i on nf t-h*> ^t-i v -i*- y 
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The ability of the purified protease to catalyse 
cleavage of the peptide Fmoc-Tyr-Gln-Glu-Phe-Asp-Glu-Met- 
Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Ile-Glu-Gln-Gly (SEQ 
ID NO: 7) has been used to define the optimum conditions 
for activity. Cleavage was detected by separating the 
substrate from the hydrolysis products by reverse -phase 
HPLC. For this purpose the mixture containing the buffer 
and the peptide incubated with the protease was injected 
into a reverse-phase Lichrospher RP-ia column (Merck) and 
eluted with an acetonitrile gradient containing 0.1% 
trifluoracetic acid. The cleavage products were 
identified by co-injection of appropriate standards, and 
by mass spectrometry. For these experiments, proteins 
produced by one of the methods described in examples 1 
and 2 were used. 

Dependence of the activity on the glycerol 
concentration was determined in a buffer containing SO mM 
Tris P H 7.5, 2% CHAPS, 30 mM DTT. Increasing 
concentrations of glycerol were added to this buffer, and 
the relative protease activity was determined^ Figure 3 
shows the results of this experiment, indicating that 
50% (v/v) glycerol is the optimum level. In a subsequent 
experiment this concentration was kept constant at 50% 
and the concentration of CHAPS was varied (figure 4) . A 
level of 2% CHAPS (w/v) was in this way found to be the 
optimum concentration. It was possible to replace CHAPS 
with other detergents compatible with the need to 
maintain catalytic activity in the polypeptides according 
to the invention. Some of these detergents are: heptyl-P- 
D-glucopyranoside , decyl-p-D-glucopyranoside, decyl-0-D- 
glucomaltoside, nonyl-p-D-glucopyranoside, N-hexyl-p-D- 
glucopyranoside, octyl-p-D-glucopyranoside, octyl-p-D- 
thio-glucopyranoside, Nonidet P-40, TweeN-20. 

At optimum CHAPS and glycerol concentrations the 
protease shows optimal activity at pH 8.5 (figure 5) . At 
this pH the stability over time is, however, lower than 
that seen at pH 7.5. To determine the effect of ionic 
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strength on the activity, a titration was performed using 
NaCl. This experiment showed that protease activity is 
inhibited at a high ionic strength (figure 9) . Kinetic 
analysis of data showed that chloride ions are 
competitive inhibitors at concentrations of up to 100 mM. 

It was thus possible to define the following optimal 
conditions for in vitro assay of purified HCV protease 
activity: 50 mM Tris pH 7.5, 3-30 mM DTT, 2% CHAPS, 50% 
glycerol. Dependence of the activity on temperature was 
analysed by means of an Arrhenius plot in which the 
logarithm of the kinetic constant is given as an 

inverse function of temperature. This graph shows 
discontinuity at temperatures above 25 °C, indicating 
changes in conformation simultaneously to the decrease in 
activity. The optimum temperature was thus determined to 
be around 22-23°C. 

As mentioned above, the protein NS4A is a cof actor 
of HCV protease. N and C- terminal deletion experiments 
have defined the peptide Pep4A with the sequence 
indicated in SEQ ID NO: 6, as the minimum domain still 
capable of inducing optimal activation. In transfection 
or in vitro translation experiments the addition of 
polypeptides containing the minimum NS4A sequence is 
essential to give effective cleavage. The addition of 
Pep4A is capable of inducing a significant increase in 
the activity of purified protease in the assay conditions 
described above. The kinetic characteristics of this 
activation are described below. Using a titration 
experiment a stoichiometry of l:l was determined for this 
30 interaction at a concentration of 300 nM. of protease, 
indicating a Kd<300 nM. 

Def i n i tion of r . hff ppf . imal substr-at-A for ar-Mv^y 

assay 

35 To define the minimum substrate whose cleavage can 

still be detected using the HPLC method described above, 
derivatives of the peptide Fmoc-Tyr-Gln-Glu-Phe-Asp-Glu- 
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Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Ile-Glu-Gln-Gly 
(SEQ ID NO: 7) described above were synthetized, with N- 
and/or C-terminal deletions. These peptides were 
incubated in the conditions defined in the preceding 
chapter in the presence of 100 nM-1.6 protease. The 
nomenclature for the amino acid residues of the peptides 
used as substrates that is adopted in the following is 
that set down by Schechter and Berger in (7). The 

residues are defined as Pn P3, P2, Pi, pi', p 2 ', 

P3' Pn', where the hydrolysed bond is Pi-Pi' (bond 

between Cys and Ala) . Table 1 shows the kinetic data for 
this experiment, defining P6 and P3' or P4' as the 
extreme limits of a substrate that is still effectively 
cleaved. Deletions beyond P6 or P3 • cause a drastic 
decrease in effectiveness, measured as k^/K,, with which 
the respective peptide can still act as a substrate. 
Deletion of P4 1 causes a less marked decrease of k^/K,,, 
however the separation of substrate and cleavage product 
by HPLC is significantly better for a decapeptide P6-P4' 
than for a nonapeptide P6-P3 • , so that the decapeptide 
PS-P4 ' has been defined the optimal substrate. 

Table 1: Characterisation of substrate 



Peptide 








(HM) 


(min-1) (M-Vh 


(SEQ ID NO: 7) Fmoc-YQEFDEMEECASHLPYIEQG 


53 


0.5 


143.0 


(SEQ ID NO: 8) Ac-YQEFDEMEECASHLPY 


56 


0.3 


87.0 


(SEQ ID NO: 9) Ac-YQEFDEMEECASHLP 


95 


0.4 


702 


(SEQ ID NO:10) Ac-YQEFDEMEECASHL 


117 


0.4 


51.0 


(SEQ ID NO:l 1) Ac-YQEFDEMEECASH 


197 


0.3 


24.0 


(SEQ ID NO:12) Ac-YQEFDEMEECAS 


>1500 




11.1 


(SEQ ID NO: 13) Ac-YQEFDEMEECA 




no cleavage 




(SEQ ID NO: 14) Ac-DEMEECASHLPY 


171 


0.3 


34.0 


(SEQ ID NO: 15) Ac-EMEECASHLP 


3137 


0.3 


2.0 


(SEQ ID NO: 16) Ac-MEECASHL 




no cleavage 




(SEQ ID NO:17) Ac-ECASHLPYDEQG 




no cleavage 




(SEQ ID NO: 18) Ac-DEMEECASHL 


100 


0.3 


47 
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15 



22.7 
23.8 



(SEQ ID NO: 1 9) DEMEECASHL 85 0. 1 

(SEQ ID NO:20) Fmoc-DEMEECASHL 95 0. 1 

The kinetic parameters K., k cat and ^/K. were 
determined for decapeptides P6-P4- corresponding to the 
other two intermolecular cleavage sites NS4B/5A and 
NS5A/SB and this data was compared with the data obtained 
using the peptide P6-P4' corresponding to the site 
NS4A/4B (table 2) . These kinetics were obtained both in 
the absence and in the presence of stechiometric 
concentrations of Pep4A. Analysis of the kinetic data 
obtained in this fashion indicates that Pep4A prevalently 
affects k cat . When the K,„ values for the single substrates 
are compared it becomes evident that the presence of two 
negative charges in P5 and in PS determined the bonding 
effectiveness of a peptide substrate. in fact 
decapeptides corresponding to the sites NS4A/4B and 
NS5A/5B with Asp or Glu residues in position P6 and P5 
have K,, values similar and significantly lower than the 
peptide corresponding to site NS4B/5A with a single 
charge in position PS. _ 

TABLE 2: Activity on peptides corresponding to cleavage 
20 sites in trans 
Peptide 

NS4A/4B 

(SEQ ID NO: 1 8) Ac-DEMEECASHL 
(SEQ ID NO: 6) + pep4A 

NS4B/5A 

(SEQ ID NO:21) Ac-DCSTPCSGSW 
(SEQ ID NO: 6) +pep4A 









(HM) 


(min"l) 


(M-U- 1 ) 


100 


0.3 


47.0 


43 


1.4 


540 


2100 


0.05 


0.4 


320 


0.8 


4.2 



NS5A/NS5B 

(SEQ ID NO:22) Ac-ED WCCSMSY 
(SEQ ID NO: 6) +pep4A 



310 
380 



4.2 

15 



220 

650 
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Further investigation was carried out on the 
relative importance of single residues within the 
sequence P6-P4-, corresponding to the cleavage site 
NS4A/4B, by mutating each amino acid singly to alanine 
and then determining the kinetic parameters for the 
mutant peptides obtained in this way. The results are 
described in table 3. This experiment identifies the 
following scale of importance of single residues for 
effective cleavage: P1»P3-P5-P6>P2-P4. Modification of 
the P' part does not have a significant effect on the 
rate of cleavage. This information was used to develop 
protease activity assay methods, useful for the 
identification of inhibitors. These methods will be 
described below. 

TABLE 3. Replacement with alanine of residues P6-P4* of 
the peptide substrate 



Peptide 






WK,„ 


* 


(uM) 


(min-1) 




(SEQ ID NO: 1 8) Ac-DEMEECASHL 


100 


0.3 


47.0 


(SEQ ID NO:23) Ac-AEMEECASHL 


150 


0.1 


9.4 


(SEQ ID NO:24) Ac-DAMEECASHL 


527 


03 


93 


(SEQ ID NO:25) Ac-DEAEECASHL 


114 


0.1 


18.1 


(SEQ ID NO:26) Ac-DEMAECASHL 


322 


0.1 


7.2 


(SEQ ID NO:27) Ac-DEMEACASHL 


132 


0.1 


18.4 


(SEQ ID N02 8) Ac-DEMEEAASHL 




no cleavage 




(SEQ ID NO:29) Ac-DEMEECAAHL 


129 


02 


32.5 


(SEQ ID NO:30) Ac-DEMEECASAL 


180 


0.3 


33.4 


(SEQ ID NO:3 1) Ac-DEMEECASHA 


94 


0.1 


23.2 



For more detailed determination of the importance of 
the residues in P6 and Pl>, a series of peptides P6-P4- 
were synthetised in which modifications were introduced 
in these positions. The results of these experiments are 
described in table 4. The results of these experiments 
underline the importance of a negative charge in position 
P6. In fact, Asp or Glu in this position are accepted 
with indistinguishable K„. Neutralisation of the charge 



K,„ 






(HM) 


(min-1) 




100 


0.3 


47.0 


85 


02 


32.0 


427 


02 


7.7 


>1000 




3.1 






27.2 






1.1 
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by introduction of Asn causes a significant increase in 
K., whereas inversion of the charge by introduction of a 
Lys residue causes an extremely marked increase in K„. 
TABLE 4. Substitution of residues PS and Pi' in the 
peptide substrate 

Peptide 

(SEQ ID NO: 18) Ac-DEMEECASHL 
(SEQ ID NO:32) Ac-EEMEECASHL 
(SEQ ID NO:33) Ac-NEMEECASHL 
(SEQ ID NO:34) Ac-KEMEECASHL 

(SEQ ID NO:35) Ac-DEMEECSSHL 
(SEQ ID NO:36) Ac-DEMEECFSHL 

Substitution of Ala in position Pi • with Ser has no 
significant effect, whereas substitution with Phe causes 
a reduction in the cleavage rate of the resulting 
substrate, measured as k^/ic,,. 

Analysis was carried out on a series of mutations of 
the position Pi, described in table 5. Substitution of 
cysteine in this position with threonine, alylglycine, a- 
aminobutyric acid, norvaline. and valine are accepted, 
even though the resulting substrates are cleaved with an 
efficiency, expressed as Wk,,, which is significantly 
lower than that of the unmodified substrate. 
TABLE 5. Substitution of the peptide substrate residue PI 
Peptide substrate WKm 

(SEQ ID NO:I8) Ac-DEMEECASHL 47.0 

(SEQ ID NO:37) Ac-DEMEEAlgASHL 43 

(SEQ ID NO:38) Ac-DEMEEAbuASHL 1 2 

(SEQ ID N039) Ac-DEMEETASHL q.6 

(SEQ ID NO:40) Ac-DEMEENvaASHL q.08 

(SEQ ID NO:41) Ac-DEMEEVASHL o.05 
Alg, alylglycine; Abu, a-aminobutyrric acid; Nva, 
norvaline 
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The information relating to substrate specificity 
can be used both for development of enzyme assays and for 
synthesis of inhibitors based on modified substrate 
sequences. For example, substrate peptides with modified 
PI residues are competitive inhibitors of protease with 
inhibition constants Ki of between 350 and 90 um (table 
6) . These peptides can be further modified to increase 
their inhibitory power by introduction of aldehyde, 
trifluoromethylketone, dif luoromethylenketone , diketone, 
ketoester, ketoamide or a-ketoheterocyclic, boronic acid 
and monoalomethylketone groups. Information on 
specificity can also allow synthesis of inhibitors that 
are not based on peptides, such as: halo-enolactones, 
isocoumarines , p-lactames, succinimides , pyrones, 
bezoxyazynones , bezoiso-thiazolines or latent 
isocyanates^ 

TABLE 6. Inhibitory action of decapeptides P6-P4' 
modified at position PI 



residue PI 


Ki 


Km 




(HM) 


(uM) 


Cys 




90 


Abu 


175 


189 


Alg 


165 


179 


Thr 


215 


180 


Val 


173 


not determined 


Ala 


173 


no cleavage 


Ser 


90 


no cleavage 


Gly 


191 


no cleavage 


Pro 


440 


no cleavage 


Cha 


350 


no cleavage 


a-aminobutyric acid; 


Alg, 


alylglycine; Cha, 



ciclohexylalanin . 

EXAMPT.P A 
Method for „ q 4 r « 

Automatic assay using an amide substrate 
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The peptide Ac-Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His- 
Leu-Pro-Tyr-Lys-e-( 3 H)Ac, (SEQ ID NO:47) derived from the 
cleavage site NS4A/NS4B, is cleaved by the NS3 protease 
with the following kinetic parameters: K. = 79 uM, k^,. = 
0.49 min" 1 and k cat /K„ = 103 M* 1 s" 1 . 400,000 cpm of "the 
labelled peptide with a specific activity of 2-10 
Ci/mmol. were incubated for 3 hours at 23 °C together with 
40 uM (K„/2) of unlabeled peptide in the presence of 200 
nM protease and 1 uM of Pep4A in 50 mM Tris pH 7.5, 50% 
glycerol, 3% CHAPS, 10 mM DTT. During this period 20% of 
the peptide substrate was cleaved. The cleavage product 
can be quantified following the method described below 
and summarised in figure 7. As can be seen from the 
figure, the mixture is placed in contact with a TSK-DEAE 
anionic exchanger. The fraction coming out of the 
exchanger is filtered, allowed to sediment or spun. The 
radioactivity is measured on the clear fraction, the 
amount of which is exclusively related to the right 
fragment (C- terminal) , given that the amide substrate and 
20 _ the left hand fragment remain bound to the anionic 
exchanger. The addition of inhibitors causes a decrease 
in the release rate of the labelled cleaved fragment. The 
more effective the inhibitor, the lower will be the 
radioactivity measured in the fraction coming out of the 
25 anionic exchanger. 

EXAMPT.K q 

Synthesis of the rimqinp^^a wh^M-nr* B1 . 

Tvr-Lvfl(N fi -Ar)-NH T fSBo'Tn wry fl ) 

The synthesis was performed entirely on solid-phase 
using the continuous -flow Fmoc-polyamide method (9) . The 
protecting group combination was: base-labile No-Fmoc for 
the a-ainino group and acid-labile protection for the 
side-chains: Asp(Ot-Bu), Glu(Ot-Bu), Tyr(t-Bu) and 
His(trt). The polymer used was composite Kieselguhr- 
polyamide (9) derivatised with a modified Rink amide 
linker (10) , p- [ (R, S) -a- [l- (9H-Fluoren-9-yl) - 
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methoxyformamido] -2 , 4-dimethoxybenzyl] -phenoxyacetic acid 
(11) (Novasyn ® KR 125, 0.1 mmol/g) . The resin, amino 
acid derivatives, activating agents and all other 
reagents were of the highest available grade from 
commercial sources. The synthesis was run according to 
the scheme given in figure 8. Couplings were performed 
with 5- fold excess of activated amino acid over the resin 
free amino groups, using Fmoc-amino acid/PyBOP /HOBt /DIEA 
(1:1:1:2) activation, except for L-{+) -lactic acid where 
Fmoc-amino acid/DIPC/HOBt (1:1:1:1) activation was used. 
Esterification of Abu to the free hydroxyl of lactic acid 
was performed using the symmetrical anhydride (Fmoc- 
Abu)20 in the presence of a catalytic amount (0.1 equiv.) 
of DMAP, for 30 minutes at room temperature (12) : the 
reaction was repeated twice to achieve 90% yield; in the 
absence of catalyst, the remaining free hydroxyls are 
unreactive in subsequent synthetic operations. At the end 
of the assembly, the resin was washed with DMF, methanol 
and CH 2 C1 2 , then dried in vacuo for 16 hours . The dry 
peptide-resin was treated with TFA/water/ 
triisopropylsilane (92.5:5:2.5) for 1.5 hours at room 
temperature; the resin was filtered out and the peptide 
precipitated with cold'methyl t-Bu ether; the precipitate 
was redissolved in 50% water/acetonitrile containing 0.1% 
TFA and lyophilised. 

Purification to >98% homogeneity was achieved 
through preparative HPLC on a Nucleosyl C-18 column 
(250x21 mm, 7 \M) using as eluents (A) water and (B) 
acetonitrile with 0.1% TFA, and a step gradient 22%B over 
5 minutes, then 22-27%B over 25 minutes, flow rate 12 
ml/min. In these conditions the peptide elutes at 21.9 
minutes. The fractions containing the pure material were 
pooled and lyophilised: yield 35%. 

EXAMPLE 

Chemical Rvnthpnis of t-he depg^pp M H^ cm^^at-^ 

Tvr-Lvs (T^-Ac) -NH 7 (SEP Tn tjh.^) 
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The synthesis was performed as described in the 
previous example. Esterif ication of Thr to lactic acid 
required three repetitions to obtain a 70% yield, which 
was also accompanied by 3% raceraization of the Thr 
residue. The D-Thr diastereoisomer was however 
chromatographically well resolved from the L- isomer, and 
easily resolved by preparative HPLC. The gradient used 
was 21%B over 5 minutes, then 21-22%B over 20 minutes, 
with the desired peptide eluting at 19.7 minutes: yield 

24%. 

EXAMPLE 7 




35 



TVr-Lvs (N 6 - f*Hl -CK a CO) -Wff, (5EQ TP NO * dd. } 

To selectively label peptide SI on the if-amino 
group of the C- terminal lysine, the protected precursor 
Ac-Asp (Ot-Bu) -Glu(Ot-Bu) -Met-Glu (Ot-Bu) -Glu (Ot-Bu) -Abu- 
y [COO] -Ala-Ser(t-Bu) -His(Trt) -Leu-Pro-Tyr (t-Bu) -Lys-CONH 2 
was assembled on the resin according to the scheme of 
figure 10. The only variation with respect to the 
synthesis of (N*-Ac) -SI was the use of Fmoc-Lys (Alloc) -OH 
instead of Fmoc-Lys (tf-Ac) -OH. The Alloc protection is 
orthogonal with respect to Fmoc and t-Bu based protecting 
groups, being removed with a two hour treatment with (0) 
PdP[(Ph 3 ) 4 ] in a solution of CHC1 3 containing 5% acetic 
acid and 2.5% N-methylmorpholine . 

The dry peptide-resin (0.07 mmol/g, 60 mg) was 
reacted with [ 3 H] acetic anhydride (25 mCi, 5.7 mCi/mmol> 
for 16 hours at room temperature. A 10 -fold excess of 
non- radioactive acetic anhydride was then used to 
complete the reaction. The resin was then washed with DMF 
and treated as previously described. After preparative 
HPLC, >98% pure peptide Ac-Asp-Glu-Met-Glu-Glu-Abu-\|/- 
[COO] -Ala-Ser-His-Leu-Pro-Tyr-Lys (if- £ 3 H] -CH 3 CO) -NH 2 was 
obtained with a specific activity of 0.68 mCi/mmol. 
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Using the HPLC-based assay, the following kinetic 
parameters were obtained for the radioactive depsipeptide 
substrate SI (SEQ ID NO: 44) : 

Ke at (min" 1 ) =9 

KJuM) = 11 

Keat/^WV 1 ) - 13.636 

Using the same assay, the kinetic parameters for the 
radioactive substrate S2 are 
K^,. (min" 1 ) =16 
K^uM) = 96 
K^/ICtM-V 1 ) = 2.780. 

Synthesis of the radioactive depsipeptide substrates 
allows set-up of a high- throughput assay for 
determination of NS3 protease activity as schematically 
illustrated in figure 11. The principle is the following: 
both the intact substrate and the N-terrainal fragment 
that originates from enzyme cleavage (Ac-Asp-Glu-Met-Glu- 
Glu-Abu-OH) are extremely acid, whereas the C-terminal 
fragment [HO-CH (CH 3 ) CO-Ser-His-Leu-Pro-Tyr-Lys (N*- [ 3 H] - 
CH 3 CO)-NH 2 ] is, according to pH, neutral or basic. It is 
therefore possible to capture the two acidic species on 
an anionic exchange resin, leaving the C-terminal 
fragment in solution, if the C-terminal fragment contains 
a radioactive marker (in this case the tritiated acetate 
covalently bonded to the B-amino group of the C-terminal 
lysine) , the resin will be able to discriminate processed 
substrate from non-processed substrate, thus making it 
possible to quantify proteolytic activity by measuring 
the amount of radioactivity remaining in solution after 
incubation with the enzyme and treatment with the ion 
exchanger. The whole process is essentially the same used 
in the high- throughput assay based on the amide substrate 
of example 4 , but the pH used in this case is 7.0 instead 
of 7.5 to minimise spontaneous hydrolysis of the ester 
bond (0.6%/hour at 23 °C) . 

EXAMPT.R B 

Synthesis of the depsipeptide substrates S3 and S4 : 
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Ac-Asp-Glu-Asp- (EDANS) -Glu-Glu-Abu-V|/ [COO} -Ala-Ser-Lys- 
( DABCYL) NH 2 (SEQ ID NO:45) and Ac -Asp -Asp- (EDANS) -Met- 
Glu-Glu-Abu-y[COO}-Ala-Ser-Lys (DABCYL) NH 2 (SEQ ID NO:46) 

The chemical formula of the two substrates S3 and S4 
is shown in figure 11 . 

The synthesis was performed on solid phase as 
detailed in the scheme of figure 13 for S3 (SEQ ID 
NO: 45), making use of two special derivatives, Fraoc- 
Asp (EDANS) -OH and Fmoc-Lys (DABCYL) -OH, prepared according 
to known methods (16-17). All the couplings, including 
Asp (EDANS) and Lys (DABCYL), were performed with 5-fold 
excess of activated amino acid over the resin free amino 
groups, using Fmoc-amino acid/PyBOP/HOBt/DIEA (1:1:1:2) 
activation, with the exception of L-(+) -lactic acid where 
Fmoc-amino acid/DIPC/HOBt (1:1:1.1) activation was used. 
Esterif ication of Abu to the free hydroxyl of lactic acid 
was performed using the symmetrical anhydride (Fmoc- 
Abu) 2 0 in the presence of a catalytic amount (0-1 equiv.) 
of DMAP, for 30 minutes at room temperature (12) : the 
reaction was repeated twice to achieve 92% yield. At the 
end of the assembly, the peptide-resin was washed and the 
peptide cleaved as described for substrate SI. 

Purification to >98% homogeneity was achieved 
through preparative HPLC on a Nucleosyl C-18 column 
(250x21 mm, 7nm) using as eluents (A) 50 mM ammonium 
acetate, pH 6 and (B) acetonitrile . The gradient used for 
both S3 and S4 was 20%B over 5 minutes, then 20-40%B over 
20 minutes, flow rate 20 ml/min; the fractions containing 
the pure material were pooled and lyophilised: yield 45% 
and 35% for S3 and S4, respectively. The kinetic . 
parameters for this substrate, evaluated through the 
HPLC-based assay (see figure 14A) , were the following: 

K cat (min" 1 )=3.51 

K.jtuM) = 10.95 

Keat/XtM'V 1 ) = 5342. 

The buffer used for the assay is the following: 33 
mM DTT, 50 mM Tris, pH 7, 50% glycerol, 2% CHAPS. The 
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incubation is carried out at p H 7.0 to minimise 

spontaneous hydrolysis of the ester bond. The assay can 
be run in a cuvette or in a (96-well) microtitre plate, 
monitoring the fluorescence as a function of time 

(Excitation wavelength 355 nM, Emission wavelength 495 
nM) . The increase in fluorescence upon substrate cleavage 
is 13 -fold. The reaction is linear as shown in figure 14B 

(fixed substrate concentration = 2 uM) . The detection 
limit was established as 1 nM for the high- throughput 
microplate assay and 520 pM for the HPLC-based assay. If 
a continuous (cuvette) assay is performed to establish 
initial rates for the enzymatic reaction, the lower limit 
for enzyme concentration is 80 nM, because of 
fluorescence quenching of the cleaved substrate at 
substrate concentrations higher than lOuM. 

DBPOSTTg 

Strains of E. coli DH1 - transformed using the 
plasmids pBac (1039-1226), pT7-7 (1039-1226), pT7-7 
(1039-1206), pT7-7 (1027-1206) and pT7-7 (1033-1206) 
coding, respectively, for the polypeptides with amino 
acid sequence SEQ ID NO:l, SEQ ID NO: 2, SEQ ID NO: 3, SEQ 
ID NO: 4 and SEQ ID NO: 5 - were deposited on 14 August 
1995 with The National Collections of Industrial and 
Marine Bacteria Ltd. (NCIMB) , Aberdeen, Scotland, U.K., 
with access numbers NCIMB 40761, NCIMB 40762, NCIMB 
40763, NCIMB 40764 and NCIMB 40765. 
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ABBREVIATIONS ANT) fiVMwnT-g Tflffl p ttt twp T^ y r 
Abu = 2-aminobutyric acid; CHAPS = 3- [ (3-colammide- 
propyl) -dimethyl-ammonium] -l-propansulphonate; DABCYL = 
4- [ [4 ' - (dimethylaminophenyl] azo] benzoic acid; 
Depsipeptide = a peptide where at least one peptide 
bond is replaced by the corresponding ester bond (the 
location (s) of the ester bond(s) within the molecule is 
usually indicated as V [COOJ- between the amino acid 
residues involved); DIEA = N,N-diisopropylethylamine; 
DIPC = N,N' -diisopropylcarbodiimide; DMAP = 4- 
dimethylaminopyridine; DMF = N,N-dimethylformmamide; DTT 
■ dithiothreitol ; EDANS =, 5-[(2'- 

aminoethyl) amino] naphthalenesulfonic acid; EDTA 
ethylendiammino-tetracetic acid; HOBt = n- 
hydroxybenzotriazole; HPLC = high-performance liquid 
chromatography; PyBOP = Benzotriazole-l-yl-oxy-tris- 
pyrrolidino-phosphonium hexafluorophosphate; RET 
resonance energy transfer; t-Bu = tertiary-butyl; TFA = 
trifluoroacetic acid; Trt (Trityl) = triphenylmethyl . 
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SEQUENCE LISTING 
GENERAL INFORMATION 

(i) APPLICANT: ISTITUTO DI RICERCHE DI BIOLOGIA 
MOLECOLARE P. ANGELETTI S.p.A. 

(ii) TITLE OF INVENTION: METHODOLOGY TO PRODUCE, 
PURIFY AND ASSAY POLYPEPTIDES WITH THE 
PROTEOLITIC ACTIVITY OF THE HCV NS3 PROTEASE 

(iii) NUMBER OF SEQUENCES: 47 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Societa Italiana Brevetti 

(B) STREET: Piazza di Pietra, 39 

(C) CITY: Rome 

(D) COUNTRY: Italy 

(E) POSTAL CODE: 1-00186 
15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 3.5" 1.44 
MBYTES 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS Rev. 6.22 
_ (D) SOFTWARE: Microsoft Word 6.0 

(viii) ATTORNEY INFORMATION 

(A) NAME: DI CERBO, Mario (Dr.) 
(C) REFERENCE: RM/X88568/PC-DC 

(ix) TELECOMMUNICATION INFORMATION 
25 (A) TELEPHONE : 06/6785941 

(B) TELEFAX: 06/6794692 

(C) TELEX: 612287 ROPAT 
(1) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS 
30 (A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

35 (*i) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Met Gly Leu Leu Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys 

1 5 10 is 



20 
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Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser 

20 25 3 o 

Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly 

35 40 45 

Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met 

50 55 60 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly 
65 70 75 80 

Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 

85 90 95 

Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser 

100 105 110 

Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser 

115 120 125 

Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala Val Gly He Phe 

130 135 140 

Arg Ala Ala Val Cys Thr Arg. Gly Val Ala Lys Ala Val Asp Phe Val 
X4S ISO 155 160 

Pro Val Glu Ser Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp 

165 170 175 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala Leu 

180 185 190 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE:' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Met Ala Arg lie Arg Ala LeU Leu Gly Cys He He Thr Ser Leu Thr 
1 5 10 15 

Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 

20 25 30 

Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 

35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 
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50 55 go 

lie Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
65 70 75 so 

Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 90 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg 

100 105 no 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

115 120 125 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130 135 140 

Val Gly He Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala 
145 150 155 i 6 o 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg Ser Pro 
15 165 170 175 

Val Phe Thr Asp Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin 

18° 185 190 

Val Ala Leu 
195 

20 (3) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Met Ala Arg He Arg Ala Leu Leu Gly Cys He He Thr Ser Leu Thr 
1 5 io is 

30 Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 

20 25 30 

Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 

35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 
35 50 55 60 

He Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
65 7 0 75 80 
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Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 so 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg 

100 105 110 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

115 120 X25 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130 135 140 

Val Gly He Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala 
145 ISO 155 iso 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg 

165 170 
(4) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4_: 
Met Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 io 15 

Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 

20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 

35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 

50 55 60 

Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val Asp 
S5 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 

85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 

100 105 1X0 

Asp Val He Pro- Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 

115 120 12 5 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
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130 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 "0 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
5 I" 170 175 

Glu Thr Thr Met Arg 

180 

(5) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 
10 (A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Gin Gin Thr Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr 
1 5 io is 

Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 

20 25 30 

20 Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 

35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 

50 55 60 

He Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
25 65 70 75 80 

Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 90 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg 

100 105 no 

30 Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

115 120 125 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130 135 140 

Val Gly He Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala - 
35 145 150 155 - 160 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg 

165 170 
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(6) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 
Gly Ser Val Val He Val Gly Arg He He Leu Ser Gly Arg 

(7) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Fmoc-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 

1 5 10 15 

He Glu Gin Gly 

20 

(8) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
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(9) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro 

1 5 10 15 

(10) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu 

(11) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His 
1 5 io 

(12) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
1 5 - io 

(13) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala 

1 5 10 

(14) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
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(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
1 5 io 

(15) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS . 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
Xaa Met Glu Glu Cys Ala Ser His Leu Pro 

* 5 io 

(16) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Met 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Xaa Glu Glu Cys Ala Ser His Leu 
1 5 

(17) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Xaa Cys Ala Ser His Leu Pro Tyx lie Glu Gin Gly 

1 5 io 

(18) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 

1 5 io 

(19) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 

(D) FURTHER INFORMATION: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Asp Glu Met Glu Glu Cys Ala Ser His Leu 

1 5 io 

(20) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
5 (B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Fmoc-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 

1 5 io 

10 (21) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Xaa Cys Ser Thr Pro Cys Ser Gly Ser Val 
1 5 10 

(22) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS 
25 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

30 (A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
Xaa Asp Val Val Cys Cys Ser Met Ser Tyr 
35 l 5 10 

(23) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Ala 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 

1 5 10 

(24) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Xaa Ala Met Glu Glu Cys Ala Ser His Leu 

1 5 10 

(25) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Xaa Glu Ala Glu Glu Cys Ala Ser His Leu 
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1 5 10 

(26) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
Xaa Glu Met Ala Glu Cys Ala Ser His Leu 
1 5 io 

(27) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
Xaa Glu Met Glu Ala Cys Ala Ser His Leu 
1 5 io 

(28) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
Xaa Glu Met Glu Glu Ala Ala Ser His Leu 
1 5 10 

(29) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Xaa Glu Met Glu Glu Cys Ala Ala His Leu 

1 5 10 

(30) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid_ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
Xaa Glu Met Glu Glu Cys Ala Ser Ala Leu 

(31) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
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(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
Xaa Glu Met Glu Glu Cys Ala Ser His Ala 

1 5 io 

(32) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 io 

(33) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asn 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 io 

(34) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE : amino .acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



WO 97/08304 



PCT/IT96/001G3 

- 46 - 



(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Lys 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34; 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 

1 5 10 

(35) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
Xaa Glu Met Glu Glu Cys Ser Ser His Leu 

1 - 5 xo 

(36) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
Xaa Glu Met Glu Glu Cys Phe Ser His Leu 

1 5 10 

(37) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
5 (B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

* 

10 <D) FURTHER INFORMATION: Xaa is Alg 

(alylglycine) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 

1 5 io 

15 (38) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear _ 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
25 (ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (a- 
amminobutyric acid) 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 

1 5 xo 

(39) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS 
35 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
Xaa Glu Met Glu Glu Thr Ala Ser His Leu 

1 5 10 

(40) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Nva (norvaline) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 

(41) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
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Xaa Glu Met Glu Glu Val Ala Ser His Leu 

1 5 10 

(42) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (N*-Ac) -NH 2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
Xaa Glu Met Glu Glu Xaa Xaa Ser His Leu Pro Tyr Xaa 
1 5 io 

(43) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 
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(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

* 

(D) FURTHER INFORMATION: Xaa is Thr ester bonded 
to the adjacent following residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (Nf-Ac) -NH 2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
Xaa Glu Met Glu Glu Xaa Xaa Ser His Leu Pro Tyr Xaa 

(44) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester -bonded to the adjacent 
following residue 
(ix) FEATURE: 
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(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to. the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (N*- [ 3 H] ) - 
CH 3 CO) -NH 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
Xaa Glu Met Glu Glu Xaa Xaa Ser His Leu Pro Tyr Xaa 
1 5 10 

(45) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 3 

(D) FURTHER INFORMATION: Xaa is Asp (EDANS) 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
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(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 9 

(D) FURTHER INFORMATION: Xaa is Lys (DABCYL) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
Xaa Glu Xaa Glu Glu Xaa Xaa Ser Xaa 
1 5 

(46) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH : 9 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix> FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 2 

(D) FURTHER INFORMATION: Xaa is Asp (EDANS) 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 9 

(D) FURTHER INFORMATION: Xaa is Lys (DABCYL) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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Xaa Xaa Met Glu Glu Xaa Xaa Ser Xaa 
1 5 

(47) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys-e-( 3 H)Ac 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr Xaa 
1 5 io 
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CLAIMS 

1. Isolated polypeptides, characterised in that they 
consist of an amino acid sequence chosen from the group 
comprising SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO: 3, SEQ ID 
NO: 4 and SEQ ID NO: 5, and in that they have the 
proteolytic activity of the HCV virus NS3 protein. 

2. Expression vectors, for the production of one of 
the polypeptides according to claim 1 in a host organism, 
comprising: 

a polynucleotide coding for one of said 
polypeptides ; 

functional regulation, transcription and 
translation sequences within said host organism, 
operatively bonded to said polynucleotide; and 

- optionally, a selection marker. 

3 . Host cell , either eukaryotic or prokaryotic, 
transformed using an expression vector according to claim 
2, capable of expressing the specific polypeptide coded 
in the chosen polynucleotide sequence. 

4. A process for preparing one of the polypeptides 
according to claim l, characterised by the fact that it 
comprises, in combination, the following operations: 

- transformation of a host cell, either eukaryotic 
or prokaryotic, using an expression vector containing a 
DNA sequence coding for a polypeptide chosen from the 
group of sequences indicated in SEQ ID NO:l, SEQ ID NO: 2, 
SEQ ID NO:3, SEQ ID NO:4 and SEQ ID NO:5; 

- expression of the desired DNA sequence to produce 
the chosen polypeptide; and 

- purification of the polypeptide thus obtained, 
avoiding resolubilisation protocols. 

5. Peptides, characterised in that they consist of 
an amino acid sequence chosen from the group of sequences 
indicated in SEQ ID NOS:7-12, 14, 18-20, 29-32, 35 and 
47, and by the fact that they can be used as substrates 
in a high- throughput assay of the in vitro . activity of 
polypeptides having HCV NS3 proteolytic activity. 
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6. Depsipeptides, characterised in that they consist 
of an amino acid sequence chosen from the group of 
sequences indicated in SEQ ID NOS:42-46, and by the fact 
that they can be used as substrates in a high- throughput 
assay of the in vitro activity of polypeptides having HCV 
NS3 proteolytic activity. 

7. A method for reproducing and effectively assaying 
in vitro the proteolytic activity of the HCV NS3 protein', 
characterised by the fact that the activity of the 
polypeptides according to claim 1 is reproduced and 
tested in a solution containing 30-70 mM Tris pH 6.5-8.5, 
3-30 mM dithiotreitol, 0.5-3% 3- [ (3-colammide-propyl) - 
dimethyl - ammonium] -l-propansulphonat e and 30-70% glycerol 
at temperatures of between 20 and 25 °C, in a high- 
throughput assay, using as substrates the peptides of 
claims 5 or the depsipeptides of claim 6. 

8 . The method for reproducing and effectively 
assaying in vitro the proteolytic activity of HCV NS3 
according to claim 7, in which the peptides of claim 5 
are used in a high- throughput assay at -concentrations of 
the polypeptides according to claim 1 of between 100 and 
200 nM. 

9 . The method for reproducing and effectively 
assaying in vitro the proteolytic activity of HCV NS3 
according to claim 7, in which the depsipeptides of 
claim 6 are used in a high- throughput assay at 
concentrations of the polypeptides according to claim 1 
of between 0.5 and 2 nM. 

10 . The method for reproducing and effectively 
assaying in vitro the proteolytic activity of HCV NS3 
according to claim 9, in which continuous monitoring of 
the proteolytic activity of the polypeptides of claim 1 
is carried out by use of depsipeptides chosen from the 
group of sequences represented by SEQ ID NO: 45 and SEQ 
ID NO i-46 as substrates , with internal f luorogenic 
quenching by "Resonance Energy Transfer" between a 
fluorescent donor, 5- [ (2 • -aminoethyl) amino] naphthalene- 



WO 97/08304 



- 56 - 



PCT/IT96/00163 



sulfonic acid (EDANS) , close to one end of the 
depsipeptide, and an acceptor group, 4-[[4»- 
(dimethylaminophenyl]azo] benzoic acid (DABCYL) close to 
the other end of the depsipeptide. 



WO 97/08304 



PCT/TT96/D0163 



1/13 



BamHI 




NS3 (1039-1226) 



lacZ(fi-gal) 



pBacNS3 

(1039-1226) 



C *# a 



recombination 
sequence 



Col El 





Sad 



SphI 



P ETL = promoter of the gene encoding the PCNA protein 
P PH = polyhedrin promoter 

Amp = gene encoding fi-Iactamase (Ampicillin resistance} 
LacZ (B-gal) = gene encoding fi-galactosidase 
Col El = pBR322 origin of replication 

Fig. 1 



WO 97/08304 



PCT/IT96/00163 



2/13 

pT7-7NS3(1039-1226) 




pT7-7NS3(i039-i206) 




rbs atg 




Smal 



NS3r 



1039-1206) 



^-lactamase 




-BamHI 
-Xbal 
-Sail 
-P$tl 

Hindlll 



Hindi 



010 = 010 promoter of bacteriophage 17 

rbs = Shine-Dalgarno ribosome binding sequence 

ATG = translation initiation site of the protein encoded by 
by gene 10 of bacteriophage T7 

B-Iactamase s gene encoding I$-Iactamse (Ampiciffin resistance) 



Col El = pBR322 origin of replication 



Fig.2a 



WO 97/08304 



PCT/IT96/00163 



3/13 



pT7-7NS3(io27 -1206) 



Ndsl 




NS3 (1027.1206) 



fi-lactamase 





' -BamHI 

- -Xbal 

- -Sail 

- -Pit! 

- -Hindlll 
•Clal 



HincU 



pT7-7NS3(io33-i206) 




NS3 ( 



1033-12W — 



rbs atg 



fi-lactamase 




- -BamHI 

- -Xbal 

- -Sail 

- -Pstl 

- -HlndUI 

- -Oal 



Hindi 



010 ss 010 promoter of bacteriophage T7 

rbs = Shine-Dalgarno ribosome binding sequence 

ATG s translation initiation site of the protein encoded by 
by gene 10 of bacteriophage 17 

B-lactamase = gene encoding fi-Iactamse (Ampicillin resistance) 



Col El = pBR322 origin of replication 



Fig. 2b 



WO 97/08304 



4/13 



PCT/IT9d/00163 



Glycerol dependence of NS3 activity 
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Ionic strength dependence of NS3 activity 
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